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Abstract — We address the problem of downlink beamforming 
for mitigating the co-channel interference in multi-cell OFDMA 
networks. Based on the network utility maximization frame- 
work, we formulate the problem as a non-convex optimization 
problem subject to the per-cell power constraints, in which a 
general utility function of SINR is used to characterize the 
network performance. Some classical utility functions, such as 
the proportional fairness utility, the weighted sum-rate utility 
and the a-faimess utility, are subsumed as special cases of 
our formulation. To solve the problem in a distributed fashion, 
we devise an algorithm based on the non-cooperative game 
with pricing mechanism. We give a sufficient condition for the 
convergence of the algorithm to the Nash equilibrium (NE), 
and analyze the information exchange overhead among the base 
stations. Moreover, to speed up the optimization of the beam- 
vectors at each cell, we derive an efficient algorithm to solve for 
the KKT conditions at each cell. We provide extensive simulation 
results to demonstrate that the proposed distributed multi-cell 
beamforming algorithm converges to an NE point in just a few 
iterations with low information exchange overhead. Moreover, 
it provides significant performance gains, especially under the 
strong interference scenario, in comparison with several existing 
multi-cell interference mitigation schemes, such as the distributed 
interference alignment method. 

Index Terms — Multi-cell, downlink beamforming, distributed 
algorithm, game theory, pricing mechanism, utility optimization, 
dual decomposition. 



I. Introduction 

IN multi-cell wireless networks, besides the intra-cell in- 
terference caused by spatial multiplexing within each cell, 
another impediment arises from inter-cell interference due to 
the ever-shrinking cell sizes. Alleviating the effects of inter- 
cell interference requires the base stations (BSs) to adjust their 
transmission schemes collectively. In fact, inter-cell interfer- 
ence mitigation has been identified as a key issue for future 
wireless networks. In particular, for downlink transmissions, if 
the inter-cell interference is mitigated via coordinated process- 
ing across multiple BSs, significant performance gains can be 
possibly obtained, especially for the users at the cell edges. 
Therefore, recently, there has been a rapidly growing interest in 
shifting the design paradigm from the conventional single-cell 
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to the cooperative multi-cell networks fT\. Various methods, 
such as [2J, |i3J, [4|, have been proposed to provide network- 
wide, macroscopic cooperation among different BSs. In these 
studies, it is assumed that the BSs in a multi-cell network 
are connected via backhaul links to a central processing unit, 
which has the global knowledge of the transmitted data from 
all the users in the network and the downlink channels from 
each BS to all the users. Such a fully coordinated case is 
sometimes referred to as networked MIMO. However, for 
large and dense networks, networked MIMO obviously in- 
curs a substantial infrastructural and computational overhead, 
which increases the system costs and hinders the practical 
implementations. This motivates the problem of constrained 
cooperation, taking into account many practical factors, e.g., 
limited backhaul capacity Q, local cooperation |6 |, processing 
complexity and delay fT^, imperfect channel state information 
(CSI) II 11, and feedback errors |101. 

On the other hand, future cellular networks are envisioned to 
be distributed systems with autonomous and self-coordinated 
cells. Each BS can make independent and rational decisions 
in a decentralized manner, with limited information exchange 
with the neighboring BSs. This motivates the study of dis- 
tributed multi-cell interference mitigation, which requires only 
the local and neighboring CSI at each BS, without the need of 
a central controller, and is therefore much easier to implement. 
Based on a generalization of uplink-downlink duality to the 
multi-cell setting, an iterative algorithm is proposed in 11 111 to 
optimally solve the multicell downlink beamforming problem 
for minimizing either the total weighted transmit power or 
the maximum per-antenna power subject to the SINR con- 
straints. An alternative to the transmit power minimization 
problem is the rate maximization problem subject to the power 
constraints, which is in general non-convex. An approach 
based on the concept of virtual SINR is proposed in |12]. In 
flSl an iterative algorithm is developed for solving the KKT 
conditions of the weighted sum-rate maximization problem 
subject to per-cell power constraints. However, the proof of 
convergence is still an open problem. Other related works 
include i l4| . which explores the relationship between the 
MISO interference channel and the cognitive radio MISO 
channel to devise rate-optimal strategies for decentralized 
multi-cell cooperative beamforming. 

Game theory provides a systematic mathematical framework 
for the study of competition and cooperation among intelligent 
and rational decision makers. There has been a significant 
amount of recent research that applies game theory to resource 
allocation problems in wireless networks ITSl - ifTol .In general, 
game models can be classified into two main categories: non- 
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cooperative and cooperative games. Although non-cooperative 
game is a useful tool to devise totally distributed algorithms, 
the Nash equilibrium (NE) of the non-cooperative game may 
suffer a significant performance degradation compared with 
the optimal centralized solution. On the other hand, the coop- 
erative game approach offers performance gain over the non- 
cooperative game, but it requires extensive message exchanges 
among all players, which implies a large communication 
overhead and poor scalability when applied to large networks. 
The pricing mechanism is another alternative to overcome the 
inefficiency of the non-cooperative game approach. In this 
paper, we develop a provably convergent distributed multi- 
cell beamforming technique based on the pricing-based non- 
cooperative game. 

In particular, we formulate the problem as a general net- 
work utility maximization problem subject to per-cell power 
constraints, which is a non-convex problem. Examples of the 
utility functions include the weighted sum-rate utility, the 
proportional fairness utility, and the a-fairness utility ||20| . etc. 
We treat each cell in the multi-cell network as a player, and 
design an efficient distributed pricing mechanism to optimize 
the network performance through coordination among the 
players. We give a sufficient condition for the convergence 
of the proposed distributed multi-cell beamforming algorithm. 
Moreover, we derive an efficient algorithm based on the dual 
decomposition technique for solving the KKT conditions of 
the downlink beamforming problem at each BS. The proposed 
technique can converge rapidly to the NE point with a low 
information exchange overhead among the BSs. It provides 
significant performance gains, especially in the strong interfer- 
ence scenario, in comparison with several existing approaches, 
including the recently proposed distributed interference align- 
ment method II2TI . 

The remainder of this paper is organized as follows. In 
Section 11 we introduce the system model and the problem 
formulation. In Section 111 we develop the pricing-based mul- 
ticell distributed downlink beamforming technique. In Section 
IV, we derive the beamforming optimization algorithm at each 
cell based on dual decomposition. Simulation results are given 
in Section V. Finally Section VI concludes the paper. 

II. System Model and Problem Formation 

We consider a downlink multi-cellular network where a 
set of BSs A4 = {1,2,..., A/} simultaneously transmit on 
the orthogonal sub-channel^ J\f = {1,2,..., N} during each 
scheduling interval. Each BS m G is equipped with T 
transmit antennas and space-division multiple-access (SDMA) 
is employed to serve multiple single-antenna mobile users on 
each sub-channel. Let Bm^ be the set of users scheduled by BS 
m G on sub-channel n G JV. For simplicity and without 
loss of generality, we assume that \Bin^\ = Q,Vm,Vn. We 
further assume that each user is served by only one BS. 

For data transmission, BS m on sub-channel n transmits 



complex symbols b[ 



in) 



through T transmit antennas 



'The sub-channel refers to a logical collection of physical sub-carriers, 
which is regarded as the minimum granularity of the radio resource allocation 
unit in this paper 
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assume that EUbZ'J^} = 1, and E{5|;\^i^6L":ij = 0, for 
(ni,TOi,fci) 7^ (n2, 7712,^2), where E{-} is the expectation 
operator. Then after normalized by the noise standard devia- 
tion, the received signal by user k G ' on sub-channel n 
can be written as 
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is the complex channel vector between BS 

m and user k G Bm^ on sub-channel n, z^^\ ~ A/c(0,l) 
denotes the circularly symmetric complex Gaussian noise 
sample, and ^is the Hermitian transpose operator. 

The SINR for user k G Bm"^ on sub-channel n can then be 



expressed a^ 
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where the terms and 1^^°"^ account for the in-cell 

and out-cell interference, respectively. 

Now, we consider the following general linear beamforming 
optimization problem where we wish to maximize a network- 
wide utility function across all users of all coordinated BSs 
and all sub-channels, by choosing the set of beam-vectors 
W = {W^'^-'jn G TV}, subject to the per-base-station power 
constraints: 

m^X C/network(W) = ^ E E ' 

meM n£j\f 



S.t. 



E E 



(4) 



where P,„ is the total transmit power at BS m. We assume that 
the above optimization problem has a set of feasible solutions, 
which can be facilitated through some form of admission 
control or/and scheduling strategies. 

In the above formulation, each user k G Bm'' is assigned 
a utility function C^/^'fe(r^\,), which is assumed to be a 
monotonically nondecreasing, concave and twice differentiable 
function of the received SINR r|^''^. Typical utility functions 
include the following: 

^We drop the expHcit dependency of f'"^, and T^"\ on W*^"' . 

1^ f f J m.fc m.k 
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. Proportional fairness utility ||20l: U(r) = log(r); 
. Rate utility: U{T) = log(l + T); 

. a-fairness utility L20J: C/(r) = (l-a)-i(r)i-", a =^ 1. 

Note that the constraint set in (|4| is convex. However, 
due to the SINR expression (O, even though the utility 
function C/,l"l,(r^"\.) is concave in terms of the SINR r,^"\., 
it is in general nonconcave in terms of the set of beam- 
vectors W("). Numerically finding the global optimal solution 
to the optimization problem (|4|i is known to be a difficult 
problem.Our objective is to develop a distributed solution to 
(|4|i where each BS updates its beam-vectors locally; and with 
the aid of limited information exchange among the BSs, some 
form of optimality can be achieved. To that end, we resort to 
the game theoretical tool of pricing mechanism. 

III. Pricing Mechanism and Distributed 
Algorithm for Non-Cooperative Beamforming 
Game 

An extreme example of distributed beamforming scheme 
is for each BS to independently update its own beam-vectors 
without considering the actions of other BSs. However, such a 
pure non-cooperative approach may result in non-convergence 
or some undesirable Nash equilibrium (NE) with low individ- 
ual as well as system- wise performance |22|. For instance, it 
is shown in ||23]| that for a two-user MISO system, the NE 
point achieved through the pure non-cooperative game over 
all possible choices of beams is far away from the Pareto 
boundary of the achievable rate region. 

The pricing mechanism r^^^-fSSl has been employed as an 
effective means to stimulate cooperation among players, and to 
guide the players' behaviors toward a more efficient NE that 
improves the system performance, by introducing a certain 
degree of coordination in a non-cooperative game. In this sec- 
tion, we propose a pricing mechanism for the non-cooperative 
multicell beamforming game and the corresponding distributed 
beamforming algorithm. We then prove the convergence of 
this algorithm. Finally we analyze the information exchange 
overhead among the BSs. 

A. Pricing Mechanism 

We model the pricing-based non-cooperative multicell 
beamforming game as 

G — {M, [WmjmeM, {UmjmeM}, 

where the elements are 

• Player set: M ^ {1,2, . . . , M}, i.e., the set of BSs. 

• Strategy set: {Wi, . . . , Wm}, where the strategy set of 
player (BS) m is the following 

(5) 



Payoff functions set: {Ui, . . . , Um}, with 
C/™(W™,W_„) 

= E E c^S(rL"i)-t^(W™,w_„,), (6) 



where W„ 



An) 



7n.k' 



k e Bln>,n G JV} and 
W_™ = {Wi,...,W,„_i,W,„+i,...Wm} denote the 
set of beam-vectors of BS m, and that of all other BSs, 
respectively. C(W,„, W_„i) is a cost function associated 
with a pricing mechanism. 
An efficient pricing mechanism should take into account 
the nature of the service requirement of each player and reflect 
accurately the cost of resource consumption for fuUfiUing each 
player's requirement. Inspired by [24J-[26|, we will apply the 
usage-based pricing mechanism to solve our problem, where 
the price a player pays for using the resource is proportional 
to the amount of resource consumed by the player. 

First, we introduce a quantity called the interference pricing 
rate of user k G Brn \ which measures the marginal decrease 
in utility due to a marginal increase in interference, given by 



(n) A _ 
"m.k ^ 



where (C/^].)' denote the derivative of the utiUty function with 

' (n) 

respect to the SINR j,. When BS m transmits signal to 
user k G ' on sub-channel n using the beam-vector w,^"'' 
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it induces the interference 
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(n) 
m,fc 



to all other users 



u^B'"^\{j,u) 7^ {■m,k),j G M. Thus, under the pricing 



mechanism, when serving user k ^ B, 
a total cost: 

r^i fill 2 



(") 



, BS m needs to pay 



E 



(n 



m.k 



^(") (") 
^m.k m^k^ 



m.k^ 



(8) 



where L|^"'j, is defined in (|9]l. We called Tj^^ ^. as the leakage 

matrix of user k G ' on sub-channel n, which accounts for 
the amount of interference caused by BS m to other co-channel 

(n) 

users on sub-channel n when serving user k G B'm'. Note 
that L^-*^ is Hermitian symmetric, i.e., 



t(») 

L^.fe, smce 



■ (n).out 



m.k 



and 



is Hermitian symmetric. The terms L 

in (|9]l account for the in-cell and out-cell leakages, 
respectively. 

Hence summing across all users served by BS m and across 
all sub-channels, BS m needs to pay a total cost of 



(10) 



Summarizing the discussion above, in the pricing-based 
non-cooperative multicell beamforming game, each BS m 
solves the following optimization problem 



max > 

keB 



E (^m,fc ij'rn.kj 



W, 



(n) 



T (") ^(") ^ 

k m.k m.k j 



s.t. 



E 



(" 



^m.k^m.k ^ 



(11) 



keB 



Notice that the objective function is still nonconcave with 
respect to the beam-vectors W„j associated with BS m; 
thus the globally optimal solution to (fTTl l cannot be found. 
In Section |IV] we drive a dual decomposition algorithm for 
obtaining the solution to the KKT conditions of ( fTTb . 
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B. Distributed Multicell Beamforming Algorithm 

We propose the following distributed algorithm for imple- 
menting the pricing-based non-cooperative multicell beam- 
forming game. 

Algorithm 1: Distributed multicell beamforming algo- 
rithm. 

1 Initialization: 

2 Each BS TO initializes W™ satisfying the power 
constraint. 

3 Repeat 

4 For m = I : M 

5 BS m obtains a solution W„i to ( fTTT ) for given 

6 W_m, using Algorithm 2. 

7 If f7™(w™, w_„o > c7„(w„, w_„o 

8 Then 
' { 

10 BS TO updates its beam-vectors as W^. 

11 Based on the new beam-vectors W.,„, BS m 

12 updates 

f-(n),out,m 



{I. 



m, n e TV}, 



and fc e 



13 
14 

15 according to ^ and (|7]i respectively, and 

16 passes them to BSs j G A4\m. 

17 } 

18 Endlf 

19 EndFor 

20 Until convergence 

We have the following observations on Algorithm [T] 

1) Only one BS updates its beam-vectors at a time, based 
on the latest out-cell interference powers and interfer- 
ence price rates (and thus the latest out-cell leakage 
matrices) from every other BS in the multicell network. 
Moreover, after a BS updates its beam-vectors, the new 
out-cell interference powers and new interference price 
rates are announced timely to every other BS. 

2) Only if C7„,(W„„W_„,) > ^™(W™,W_,„) holds, 
BS TO updates its beam-vectors as W„i. Otherwise, BS 
TO keeps its old beam-vectors. This method is based on 
the better response strategy in game theory, which refers 
to an update procedure where the players choose actions 
that increase their utilities as opposed to maximizing 
their utilities in the best response strategy. Notice that 
the best response strategy cannot applied due to the 
nonconvexity of (fTTI) . 

These features ensure the convergence of the algorithm, as 
discussed next. 



C. Existence and Convergence of NE 

The Nash equilibrium (NE) is a well-known concept for an- 
alyzing a game. A set of beam- vectors W* — (W* , . . . , W^j) 
is an NE if, for every BS to G TW, Um{W*^,W%^) > 
Um(Wm, W* ,„), yWm £ Wm- That is, given the other BSs' 
beam-vectors, no BS can increase its utility unilaterally by 
changing its own beam-vectors. For the multicell beamforming 
game under consideration, the existence and convergence of 
NE is heavily dependent on the concavity of the utility 
function u'^^^. We first introduce a quantity that measures 
the relative concavity of a utility function. Specifically, the 
coefficient of relative risk aversion associated with the utility 
function [/(F) is defined as 



«(r) = - 



F • [/(F)'' 

u{ry 



(12) 



where C/(F)' and [/(F)" denote the first- and second-order 
derivatives, respectively. 

We have the following result on a sufficient condition for 
the convergence of Algorithm 1. 

(n) 

Proposition 1: Suppose that the utility function 'f, sat- 
isfies 

< K^lk < 2, Vfc e Bl"J, meM, neJV, 



then Algorithm 1 converges to an NE point. 
Proof: See Appendix A. 



□ 



Remark: The condition < k^^^ ^ < 2 can be interpreted as 
requiring that the utility function to be sufficiently concave, 
but not too concave. If the utility function is too concave (i.e., 
'^m k ^ 2), the updates may be too aggressive to guarantee 
convergence. Fortunately, this condition is satisfied by most 
utility functions of interest, as discussed below. 

1) For the proportional fairness utility function C/(F) = 
log2(r), its coefficient of relative risk aversion is k = 1. 

j'p\ 1 — a 

2) For the a-fairness utility function C/(F) = ^ with 

a 7^ 1, we have k^'''^ = a. Hence, for < a < 2 (a 7^ 
1), we have < k < 2 (k 7^ 1). 

3) For the weighted sum-rate utility U{T) = a;log2(l + 
or), with < 6* < 1, we have the following. 

• 0=1 corresponds to the Shannon rate with < 

K = — < 1. 

* < 9 < 1 corresponds to the achievable 
rate for some practical modulations, where 9 = 
"TSic^W)' '?^i''^2 are constants depending 
on the modulation and BER is the required bit-error 



rate. We have < k 



er 

i+er 



< 1. 
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Computing 




Computing 



j(n),outj| 



Computing 



User u 



{7rj"j,w e B'f'}, BS 3 sends them to BS m. 
Remark: Note that compared with II13I . in our scheme, BS 



(n),out ,j 



itself to BS 



j sends {tt^'^, w G } instead of L„ 

TO. Since Lm'''°"'^ is a T x T complex-valued matrix, while 
{tt^'^^m e Bj""*} is a Q X 1 real-valued vector, and typically 
Q <T, our scheme incurs a much lower information exchange 
overhead. 



Fig. 1. Local computation at eacli BS and tiie information exciiange among 
the BSs. 



D. Information Exchange Overhead 

(n) 

In order to compute the SINR \. and the leakage matrix 

-'-'m i'^ (fTTT i. BS TO needs to get certain information from the 
neighboring BSs. In Fig.[T] we give a graphic illustration of the 
local computation at each BS and the information exchange 
among the BSs, which is further elaborated as follows 

BS TO can compute 1^'^, in ©-([Sli for each user k E B 
on each sub-channel n £ N through the following. 



(") 



I ) Computing the 



(n) 

711 ^k' 



first 

2 



term 



r(n), 



in (O. BS TO only needs the 



knowledge of the direct downlink channels h^"^^. and of 



the local beam-vectors {w^''^,,, k' e Bm'\k} computed 
at the previous iteration. Thus, no information exchange 
is needed. 



2) Computing 
needs the 



the second term 



r(n),out ,j 
''m.k 



h^(n) (n) 



local beam-vectors 



j e M\m in BS j e M\r 



computed at the previous iteration, and the interference 
downlink channel h^",\ which is sent from user 



k 6 Bm to the serving BS to, and then from BS to to 



^(n),out ,j 



and then sends it to 



BS j. BS j calculates X; 
BS TO. 

BS TO can compute L^'''^, in ^ for each user k £ B, 
each sub-channel n E J\f through the following 



in) 



and 



I) Computing 



the 



first 



term 



'^ra^k 



k'^m k' ®- ^'^ ™ °"ly needs 



the direct downlink channels {h^''^,,, k' € Bm'\k} and 

the local interference pricing rates {Trj"-'^,, , k' G Bm \k} 
computed at the previous iteration. Thus no information 
exchange is needed. 



2) Computing the 



second 



term 



■ (n),out ,j 



E„^R(") 7T)"^tyi"W/:!u,j G X\to in ®. BS J needs 



the interference downlink channels {hm''ti, 



each of which is sent from user u to the serving 
BS j. Notice that these channels have already been 
sent when computing 2^('^)'°"t,™ Thus this incurs 
no additional information exchange. On the other 



hand, the interference pricing rates {tt 



(n) 



G S"'} 



can be computed locally according to (|7|i for which 
the quantity xj"^ is needed, which in turn has been 
computed at the previous iteration. After computing 



IV. Per-BS Beam-vector Update 
VIA Dual Decomposition 

In this section, we derive a dual decomposition algorithm 
for obtaining the solution to the KKT conditions of Problem 
(fTTT i at each BS, which is the key step in Algorithm [T] (line 5 
and 6). 

A. Dual Decomposition 

The dual decomposition technique is an effective method for 
decoupling the coupled constraints and performing distributed 
optimization. 

First, by introducing a set of scalar variables pm = 
^p:^^,k E Bm\n E N^, we can rewrite the optimization 
problem (fTTl i as follows: 



max > 



(^m,/c(rL,fc) ^\n,k^'rn,k^ 



m.k I ' 



fcGB<,"' 



B-t. E E p'£k < Pm, (13) 

^m,k^m,k ^ Pni,k' «^ ^ : n E JV . 

Notice that the optimization problem ( fTsT i has only one single 
coupled constraint E„eA^ Efce^c^) Pmi < -P™- Then, we 
form the Lagrangian (fT4] | of the optimization problem (fTsl i 
with respect to the coupled constraint. In (fl4] l. A„i denotes 
the Lagrangian dual variable. 
Define the dual problem as 



min (A„ 



= where the objective function Dm (A„i) is 



max L™(W„,p™,A„ 



<Cl "eA/-. 



(15) 



(16) 



Notice that the dual function Dm (Am) is the pointwise 
maximum of a family of affine functions of A„i, hence it is a 
convex function of A,™. 

B. Decoupled Subproblems 

First, we need to compute Dm (Am) for a fixed Am- Due 
to its separable structure, the dual function Dm can be 
decomposed into NQ subproblems D''^\,n E TV, fc G Bm'^ 
as follows: 



"1^^ , , f^m.fcl^ m,fc) ~ Wm,fc-^m,fcWm,fc ~ ^^Pm,k^ 

(17) 



.(") 



S.t. 



-(") (n) <; (n) 
^m,fe^m,fc ^ Pm,k- 
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(n) 



^(n) T (") (") 
m,k m,k m.k 



(14) 



The Lagrangian of the subproblem 'j^ is given by 

- ^m.,k m.k J ~ ^m.k^m.k^m,k ~ ^"iP^.k 

An) / -.(n) in) („) \ 

^^m,k \ ^m,k^m,k ^ Pm,k J ' 

(n) 

where v^ ^. is a dual variable associated with the constraint 

-.(n) (n) , (n) 

We next obtain the KKT conditions, given by 



dL 



in) 



(«) 



= U. 



n.k\ m.k } 



9T w'"' 



2-i"ilTW^:i ^ (19) 



dL 



in) 



= -Am + V, 



in) _ 



(20) 



C. Master Problem 

We need to solve the master problem (fTSl l on top of the A^Q 
subproblems. Since the master problem is convex in A,„, we 
will apply the subgradient method. Notice that where W*„ — 



{w 



(")* 

m.k ' ' 



G TV, k £ Bm^ } is the optimizer for ( fTsT i in the 



definition of (A™). 

Thus, we can set the subgradient of D„i (Am) as P„i — 
EnGA^Efegf5(;')-Pm,fc- The subgradient search suggests that 

we should increase Am if Pm < I]„ga/' SfceeL"' ^^mi*' 
decrease Am otherwise. Notice that the adjustment occurs in 
a one-dimensional space, thus a simple bisection method can 
be employed. 



D. The Beamforming Algorithm at Each BS 

Finally we summarize in Algorithm 2 the dual- 
decomposition-based beamforming algorithm at each 
BS. 



where indicates the T x T identity matrix. 

By combining the above two equations, we obtain 



rji(") (") 

m,fe m,/c' 



with T, 



in) 



. (n) 



Ami 



(21) 
(22) 



Solving w^"^j, from ( 1211 1. we obtain the expression for the 

' (n) 

beam-vectors associated with user k e Bm for a fixed Am, 
as follows. 

Proposition 2: For a fixed Am > 0, the solution to the KKT 
conditions of problem ( fTTl ) is of the following forrr0: 



m.k 

in)* 
Pm.k ' 



.(")tu(») [7 

■m.k'-'-m.ky y 



-^(n)* in)* 
m.k m.k 



1 



m.k J m.k m.k^ 
' m.k I m.k m.k' 



with ^a=inv{^::)( ^,^;;g^,, )), 

l/fhi 



T 



(n) 
m,k 

in) , 
m.k 



(n) rp(n)tv (n) 

?7i , m , A; rn , /e 



(")tu(n) 



m,A; m.k 



T 



(n) 
m.k' 



(23) 
(24) 
(25) 
(26) 
(27) 



Furthermore, if w^"',! 7^ 0, Am = is feasible only if 



Algorithm 2: An algorithm for updating the beam-vectors 
at each BS m. 

1 Initialize A™'" and A™^'^; 

2 Repeat 

3 Set Am ^ (Ar" + Af,r) /2; 
Repeat through fc = 1, Q; 1, Q; ... 



Compute and thus {IZK=i 

according to (O; 

(n) \N 



("):iniJV 
''m./i; 

according to (|9|l 

Compute {w::^)-,pS*}n= 
dig and dal. 
Until convergence 

Then set A™" ^ Am 
Else set A^^f ^ Am 
End If 



Compute {l^X K^, and thus {L^^^U^Li 



I^^Li for ( flTl l according to 



16 Until I A" 



A" 







Note that Algorithm |2] can also be viewed as an iterative 
procedure for solving the KKT conditions of problem ( fTTl ). 
which consists of the following. 

in) 



hm,fc G £'(Lm,fc)' where £'(L^_\) denotes the column span 1) The stationarity condition (EB for fc G Bin', n eAf. 



in) 
m.k' 



of the matrix L 
Proof: See Appendix B. □ 

denotes the pseudo-inverse; Inv{f7'} is tlie inverse function of U'. 



2) The sum-power constraint 



E E 



^m.k^m.k ^ 



(29) 
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> E E {c^^l-<tLawLt}-A™(E E ^Lt-^™) 

= A„ (A„0 - ( ^ ^ pI;;?* - Pr,^) (A,„ - A„) , (28) 



keB), 



3) The complementary slackness conditions: 



(E E 



(30) 



with Am > 0. 

Starts with a given Am, Algorithmic solves ( 1211 1 for fc G 
n G A/^, and then adjusts A™ according to the search direction 
suggested by the power constraint ( |29] |. such as to satisfy the 
complementary slackness conditions ( l30l l. 

Notice that since M = {1, 2, A^} is a set of orthogonal 
sub-channels, for ni,n2 G TV. ni ^ n2, and fci,fc2 G 
fci ^ ^2, we can update {w^_\.''^ , J for the subprob- 
lem D^"\^ , and {w^"^^ ,p^"\^ } for the' subproblem D^"'^ 
in parallel. Such a simultaneous update can improve the 
convergence speed of the iterative procedure, especially when 
N is large. 

Moreover, it is easy to derive the following KKT conditions 
of the original problem (|4|i: 



rrin) /^h^^W^' 

k G B^^\neU,me M 
ne^f /^k£B.,. 

Let W^^^ the beam-vector set satisfying the KKT conditions 
of (dB for BS m G A^. By comparing (EB, (tlDl, ^ with 
( CT . ( |32] |. ( [33] ). respectively, it is obvious that W^"^'^"^'^ = where d. 



1+X 



-(") 



. rji(n) (n) 

m,fe m,fc' 



(31) 
(32) 

P,n) = 0,?n G M. (33) 



{Wf^-^-'", . . . , W^-^-'"} is the beam-vector set satisfying the 
KKT conditions of the original problem (01). 

In Algorithm 2, each BS m G can achieve the KKT 
solution to ( fTTl i. From Proposition 1, we know that Algorithm 
1 converges to an NE point. Furthermore, line 6 in Algorithm 
1 guarantees that the beam-vector update at each BS cannot 
decrease the total utility. Thus, the total utility at the NE 
point is not smaller than that at the point satisfying the KKT 
conditions of the original problem (01. 

V. Simulation Results 

A. Simulation Setup 

We consider a network with hexagonal cells A^t = 
{!,..., 27} shown in Fig. |2l The distance between adjacent 
BSs is Dbs =2000m. Let Mco = {!,..., M} be the set of 
coordinated BSs, and A4,m = A4t\M.co be the set of unco- 
ordinated BSs. On each sub-channel, Q users are uniformly 
displaced around the serving BS within a circular annulus of 




Fig. 2. The simulated network. 



external and internal radii of D and 0.91?, respectively. Since 
the proposed method is expected to benefit most the cell-edge 
users, by setting D = 1000m as the default value all users are 
around the cell edges. 

The base-band fading channel from the m-th BS to the fc-th 
user on sub-channel n is modeled as 



(") 



200 

^"'m.k , 



3.5 



/(") v,(n) 



(34) 



(") 



is the distance from the m-th BS to the k- 

th user on sub-channel n; 10 logio ^ ^^^^ Gaussian 

random variable with zero mean and a standard deviation 
of 8 accounting for the large scale log-normal shadowing; 
finally, ~ A/'c(Ot, It) is a circularly symmetric complex 
Gaussian random vector accounting for Rayleigh fast fading. 
The total noise power ri^\ at each user is modeled as 



(") 

''lm,k 



E 



200 



3.5 



;(») 



P 



(35) 



where ct^ is the thermal noise power, and the second term 
accounts for the uncoordinated inter-cell interference. Note 
that Tyl"''^, is used to obtain the normalized signal model ([T])- 
We assume equal power (i.e., P^ = P) for each BS m in the 
following simulation, and consider the system the performance 
under different signal-to-noise ratio (SNR), which is defined 
as 7 ^ P/a\ 
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B. Convergence Behavior 

We first illustrate the convergence behavior of the proposed 
distributed beamforming scheme, under three different utility 
functions, namely, 

1) Proportional fairness utility: t^^i(r^^j,) = 

2) a-fairness utiHty: U^^iivl^^^,) = ivl?^^^^ with 
(a = 2); 

3) Sum-rate utility: f/^:i(r^),) = ^ log^ll + r|:i). 
The number of coordinated cells is M = 7; the number of 
sub-channels is = 3; the number of transmit antennas at 
each BS is T = 6; the number of SDMA users is Q = 3; and 
the location parameter of the users is D = 1000m. Algorithm 
1 is initialized by the channel-matched (CM) beamformers. 
Note that initializing with the more sophisticated beamformers, 
such as the in-cell zero-forcing (ICZF) beamformer, may only 
slightly increase the convergence speed. 



M= 7,N=3, T=6, Q=3,D=1000, y=30 
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Fig. 4. Convergence of Algorithms 1 & 2 (o-fairness utility). 
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Fig. 3. Convergence of Algorithms 1 & 2 (proportion fairness utility). pig. 5. Convergence of Algorithms 1 & 2 (weighted sum-rate utility). 



The convergence behaviors of Algorithm 1 and Algorithm 2 
at SNR 7 = 30dB are shown in Figs. [3]|5] for the above three 
utility functions, respectively. In these figures, the squares cor- 
respond to the outer iteration, i.e., the iterations of Algorithm 
1. In each outer iteration, one BS updates its beam- vectors for 
all its users. The solid lines correspond to the inner iterations, 
i.e., the iterations of Algorithm 2. In each inner iteration, 
the BS updates the beam-vectors of one of its users. It is 
seen that the proposed distributed beamforming technique can 
significantly improve upon the initial network utility through 
optimizing the power allocation across the beams and the beam 
directions according to the conditions of the in-cell and out- 
cell interference. Moreover, the value of the network utility 
monotonically increases at each outer iteration, which confirms 
the theoretic result of Proposition 1 . 

Our extensive simulations reveal that the convergence speed 
of Algorithm 1 is affected by the system parameters as follows. 

« The number of coordinated BSs A/: A larger M corre- 
sponds to a slower convergence. 




Fig. 6. 



30 
Y[dB] 

Total proportional fairness utility versus SNR. 
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M=7,N=3,T=6,Q=3.D=1000 

I 




30 

y[dB] 

Fig. 7. Total a-fairaess utility versus SNR. 
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Fig. 8. Total weighted sum-rate utility versus SNR. 



• The SNR 7: A larger 7 corresponds to a slower conver- 
gence. 

• The number of antennas T and the number of co-channel 
users Q: They only slightly affect the convergence of the 
outer iteration, i.e.. Algorithm 1; but significantly affect 
the convergence of the inner iteration, i.e.. Algorithm 
2. Specifically, larger T and Q correspond to slower 
convergence of Algorithm 2. 

In general it is seen that a relatively small number of outer 
iterations are sufficient for Algorithm 1 to converge. Note that 
there is no step-size parameter in Algorithm 1 and the overall 
utility could change dramatically in a single update, leading 
to rapid convergence. This is in contrast to some conventional 
distributed optimization algorithms, in which some step-size 
parameter controls the speed of convergence. 



C. Performance Comparisons 

We next compare the performance of the proposed dis- 
tributed beamforming method with some existing techniques, 
including the simple channel matching (CM) method, the 
in-cell zero-forcing (ICZF) method, the iterative coordinated 
beam-forming (ICBF) method proposed in [131, and the 
more recent maximum SINR interference alignment (MSIA) 
method 121], as well as the approach based on the pure non- 
cooperative game (i.e., without the pricing mechanism) and 
the full-cooperation based method. Furthermore, the sum-rate 
performance of the time-sharing scheme is also considered, 
where the Q users in each cell access each sub-channel via 
TDMA. 

In Figs. |6][8] the total utility values versus the SNR for 
the above-mentioned methods are plotted for the three utility 
functions, respectively. Note that for CM, ICZF and MSIA, the 
beamformer solutions are independent of the utility function. 
It is seen that the proposed distributed beamforming method 
outperform all other techniques in the sense of offering higher 
total network utilities. Moreover, we note that the network 
utility gain provided by Algorithm 1 is affected by the system 
parameters as follows. 

• The distance from the BS to the user D: The network 
utility gain provided by Algorithm 1 is larger when the 
users are closer to the cell edge. Intuitively, the cell- 
edge users experience higher path losses and suffer from 
higher out-cell interference. Through Algorithm 1, both 
the available power across beams and the beam directions 
are optimized to mitigate these effects. On the other hand, 
when the users are close to the serving BS and away from 
the cell-edge, the per-cell optimized solution based on 
the non-cooperative game can achieve high performance 
without any information exchange among the BSs. 

• The number of antennas T and the number of cochannel 
users Q: The network utility gain provided by Algorithm 
1 is larger for larger T and Q. Intuitively, when the 
number of co-channel users Q is large, the users suffer 
from the high out-cell and in-cell interference. Due to 
the large number of antennas. Algorithm 1 has enough 
degrees of freedom to ensure the good quality of service 
of each user while causing minimum amount of in-cell 
and out-cell interference. 

• The number of coordinated BSs M: The performance 
gain provided by Algorithm 1 becomes larger for larger 
M. This is because a larger M corresponds to a larger 
number of degrees of freedom to mitigate the interfer- 
ence. 

• The signal-to-noise ratio 7: Algorithm 1 only provides 
marginal gains at low SNR, while the gain is more 
prominent at high SNR. This is because at high SNR, 
the interference becomes dominant factor for limiting the 
system performance, which can be effectively mitigated 
by Algorithm 1. 

Comparison with non-cooperative game: As expected, the 
approach based on the pure non-cooperative game, denoted by 
Non-coop in Figs. |6][8] yields inferior performance in terms of 
the network utility. The reason is that when each BS optimizes 
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only its own utility function, it does not account for the 
disutility it causes to the users served by other BSs due to 
the interference it generates. In economic terms, a disutility 
of one agent due to the action of another is referred to as a 
negative externality, which is the root of the inefficiencies of 
the non-cooperative game. For large AI, the system becomes 
interference Umited, and the pricing mechanism in Algorithm 
1 can significantly increase the achievable network utility by 
implicitly inducing cooperation and yet maintaining the non- 
cooperative nature of the beamforming solution. 

Comparison with full-cooperation based algorithm: The full- 
cooperation based algorithm has also been simulated. The 
results show that its utility performance is always similar to 
that of our proposed Algorithm 1. Thus, for clarity we did 
not plot its utility performance in Figs. |6]|8] Notice that this 
performance similarity is reasonable: because the optimization 
problem of downlink beamforming in multi-cell OFDMA 
networks is non-convex, even full-cooperation based algorithm 
only achieves the KKT solution. For our proposed Algorithm 
1, the total utility at the NE point is no less than that at the 
point satisfying the KKT conditions of the original problem. 

Comparison with distributed interference alignment: The max- 
imum SINR interference alignment (MSIA) method is an 
extension of the interference alignment algorithm proposed 
in II2TI . where the receive filters are chosen to maximize 
the SINR, and in the meantime to minimize the leakage 
interference at the receivers. For the MISO scenario discussed 
in this paper, interference alignment can be accomplished 
through symbol extension. In the simulations, the number of 
symbol extension 5' is set from 2 to 10, and the degree of 
freedom d for a user's message is set from 1 to its upper 
bound S. In Fig. ©-(HJ, we plot the performance of MSIA 
for the case of the best choices of symbol extension S and the 
degree of freedom d. 

It is seen that the performance of MSIA is inferior to that of 
Algorithm 1, especially at the strong interference scenario. In 
fact, the MSIA is even inferior to the per-cell optimized non- 
cooperative game solution in some scenarios. The reasons are 
as follows. 

• The MSIA only optimizes the beam directions. In con- 
trast. Algorithm 1 optimizes both the beam directions and 
the power distribution across the beams. 

• The MSIA is designed to maximize the SINR and in 
the meantime to minimize the leakage interference. In 
contrast. Algorithm 1 is designed to optimize a general 
utility function. 

• In MSIA, the iterative algorithm alternates between the 
original and reciprocal networks. Within each network, 
the receivers update their interference suppression filters. 
In contrast. Algorithm 1 is iteratively implemented only at 
the transmitters (BSs) and it can converge only in a small 
number of iterations. Thus, Algorithm 1 incurs a much 
lower information exchange overhead, in comparison 
with MSIA. 



VI. Conclusions 

We have considered the downlink beamforming problem 
for co-channel interference mitigation in multi-cell OFDMA 
networks. The problem is formulated as a general utility max- 
imization problem subject to the per-cell power constraints, 
which is non-convex. We have proposed a distributed solution 
based on the non-cooperative game with pricing mechanism. 
We have shown that for some popular utility functions, such as 
the weighted sum rate utility, the proportional fairness utility, 
and the a-fairness utility, the proposed algorithm converges 
to a Nash equilibrium point. Moreover, we have developed an 
efficient algorithm to solve the KKT condition at each base 
station based on the dual decomposition technique. We have 
provided extensive simulation results to illustrate that the pro- 
posed method can converge to a Nash equilibrium in a small 
number of iterations, and it outperforms several state-of-the-art 
approaches to multicell interference mitigation, including the 
recently developed distributed interference alignment method. 

In this paper, we have assumed the perfect instantaneous 
channel state information. From the practical point of view, 
the following issues remain to be investigated in future: (1) the 
impact of the reduced information exchange; (2) the robustness 
of the proposed method in the presence of transmission delay, 
packet loss and estimation error. 

Appendix A: Proof of Proposition 1 
After some simple manipulations, we have0 

2 



''m , k 



m.k 



m.k^ m.k 



(1 



(36) 

By the assumptions on the utility function ^, we have 
<fc)'>0' and < Atl:;, < 2. Thus 



d t 



> 0. 



(37) 



Hence ul^j, is a convex function of I^ 'j,. Then we have 



c/i"i(w("))>c/i"i(w("))+ "^'^ 



m,k ^ 
An) 



(n) 



An) 
-m.k 



(38) 

where and I,'^ denote the interferences at the current 
operating point W = {W("\n G TV} and at any new 
operating point W = {W("\ n e TV}, respectively. 

Summing up (l38T l over all users served by BS m and over 
all sub-channels, we have (|39] |. 

Hereafter we will drop the explicit dependency of tt^ f. on 

'^Here, we drop the explicit dependency of f/^'], on T^^'j,, and denote 
t/<"i(r("' (W("))) as I/'"' (WM). 
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E 



E 



c/i:i(w(")) > 



E 



E 



(39) 



Assume that BS m applies Algorithm 1 to update its beam- 
vectors, given the current beam-vectors W = {W("\ n G 
M}. According to the condition of STEP 6 in Algorithm 1, 
we have 

C/™(W^, W_„,) > C7™(W„, W_,„), (40) 

where W = {Wi, • • • , W„,_i, W,„, W,„+i, • • • , Wm} de- 
notes the operating point after BS m updates its beam-vectors 



Plugging (|6]l-(fT0ll into ( |40l l. we have (1411 ). 
Because of BS m's beam-vector update, the received inter- 
ference by user u G B''-^\i G M\m is changed from x'"j to 



l]"^, and 



E (I 



-(") _ (") 

1 W f 



(„) 

1 W 7 



keB), 



Thus, we have ( |43T l. 

Adding the constant terms ^ 12 J2 1 

to both sides of dTO . we have ( l44l l. 



(42) 



Subtracting the terms 



(") 



from both sides 



E E E E ^ 

(i, u)^( m,k) 

of ( |44] |. and using ( |43] l, we have ( |45] l. 

Summing up ( |39] l over j G yV4\TO at the updated operating 
point W, we have 



E 



E E 



in)) 



^EE E 



Adding E ^/miCW")) to both sides of we 



have ( |47] l. 

Combining (|45] | and (l47T i yields 

t/;:(wH)> E E E C(w("^)- 



EE E 



C/„ctwork(W) 



ork(W) 



(48) 

Hence, when BS m adjusts its beam-vectors, the total utility 
cannot decrease. Since at most one BS updates its beam- 
vectors at anytime, the total utility is non-decreasing in each 
iteration. As both the number of players and the size of the 
strategy sets are finite, the total utility is bounded. Thus, the 
total utility will convergence. 

Now, we assume that Algorithm 1 converges to a fixed point 
W* = (W^;, . . . , W^^). If W* is not an NE point, then there 



exists W = (W^, . . . , W„j, . . . , W 



M/ 



such that 



(49) 



Applying the similar deduction in (|40]i-(|48]l, we have 

C/„etwork(W) > C/„etwork(W*), (50) 

which contradicts the assumption that W* is a fixed point. 
Thus W* is an NE point. □ 

Appendix B: Proof of Proposition 2 

The proof is along the similar line of that for Proposition 
1 in ifTSl . There are two cases for the solution to the KKT 
conditions of ( fTTI l. 
Case 1: A™ > and w|,"\^ ^ 0. 



Notice that T, 
to have 



(») 



is a positive-definite matrix. Thus, it is easy 



(51) 



Obviously, dSB and (E) imply that h^^^^^w^^^. ^ and 

"■m,k °^ ^mM^rn,k- 

KKT conditions of dTTI l must be of the form 



'^ni.k 



In) 



rp(«)t, (n) 



(52) 



and w,(,':i ^ 0. 



Case 2: A„ 

It is easy to see that ( 1211 1 is satisfied only if one of the following 
two conditions holds: 



in) 



(a) h: 

(b) hL"i. e ^.(L, 



and T w*") 

(«) N 
?7i . A; / ■ 



0; 



If (a) holds, , 

' ' m,/c m,k 



implies that the non-zero 

(n) ^fn) 

beam-vector is orthogonal to the channel vector 

in) 

In this case, user k G Bm cannot receive any information 
(46) from the serving base station. Hence we discard the solutions 
hL"iw™.i = and L^^'U^^^^ = for the case A™ ^ and 



w 



(n) 



^0. 



If (b) holds and A.,„ = 0, a non-zero beam-vector which 
satisfies (l2Tl i must be of the form 



(n)* f (n)ti (n) 



(53) 



Notice that (|53] l is equivalent to ( |52] i with A^ = 0. Meanwhile, 
notice that multiplying w|j"''j* by any unit-norm complex 
number does not affect either the objective function or the 
power constraint in ( fTTI l. Hence, for the two cases discussed 
above, we can set the unique solution to the KKT conditions 
of dnli as 

(54) 



_ o{n) rp(n)ti (n) 
m.k i-^m,k m,k m,k^ 



where I3[^\ is some scalar constant. 

To determine we plug ( l54l l into ( I2l1 l to obtain 



(") 



„(„) „(„)t^,(.^) 



l+I^ 



rp(n) o{n) rp(n)ti^(n) 

7n^k^ ra^k m,k m^k 



(55) 



12 



E E (c. - E E -r: 

^ E E (C (w("0 - E E -J: 

0»7^(m,fe) 



-^(n) _ (n) 

n w I 



,-^(") (") 

n w I 



(41) 



E E E -?ift:^-4:0 = E E E E -?i(|hL"l<lf ^^-^^^^ 
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= E E <fe(^^"V E E E t^r«(w("VE E E E 

"e-^ feeB$?' ^'■e-^ «ee<"' 

(i,«)^(m,fc) 

^ E E E f^;:(w("^)-E E E E ^ ' 
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n w I 
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E E C.(w("0+ E E E f^;:(w^"^)-E E E E 

(j,u)/(m,/c) 

E E <feM+ E E E ^rJ(w("^) - E E E 
E E t^SM + E E E K«(^'"')-^-(C-^:)]>EE E t^i.:^(w'"^)- 



n w ) 

m.u m.k 



jeA^ neAA^ggi") 



(45) 



EE E t^rj(w("^)>E E <fe(w("')+E E E KJ(w("^)--r:(^:-^-:); 



jeM neAA„gg 



(n) 



jeM neA/-„eg(") 



(47) 



Considering j. 
have 



',fe = Cfel' and ^ 0, we Thus, we obtain 



U. 



■n) ( 



|u(") rpCnjti^C") |2 

l"m,fef^TO,fe m,fe "m,fc I 



1+X 



(») 



i + x 



rp(")tv,(") 

m,fe m,fe m,fc 



which is equivalent to 



Pm,k 
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;(") rT,(n)tu(") 
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[1] 



[2] 
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m.k 



(n) ^(n) 
m,fc m,fc* 
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□ 
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